Compression of Topological Fragment Spectra (TFS) for Accelerating Chemical Data Mining
نویسندگان
چکیده
منابع مشابه
Longest Fragment First Algorithms for Data Compression
On{line text{compression algorithms are considered, where compression is done by substituting substrings of the text according to some xed dictionary (code book). Due to the long running time of optimal compression algorithms, several on{line heuristics have been introduced in the literature. In this paper we analyse two modied version of an old algorithm introduced by Shuegraf and Heaps [8]. W...
متن کاملAccelerating Lossless Data Compression with GPUs
Huffman compression is a statistical, lossless, data compression algorithm that compresses data by assigning variable length codes to symbols, with the more frequently appearing symbols given shorter codes than the less. This work is a modification of the Huffman algorithm which permits uncompressed data to be decomposed into independently compressible and decompressible blocks, allowing for co...
متن کاملthe clustering and classification data mining techniques in insurance fraud detection:the case of iranian car insurance
با توجه به گسترش روز افزون تقلب در حوزه بیمه به خصوص در بخش بیمه اتومبیل و تبعات منفی آن برای شرکت های بیمه، به کارگیری روش های مناسب و کارآمد به منظور شناسایی و کشف تقلب در این حوزه امری ضروری است. درک الگوی موجود در داده های مربوط به مطالبات گزارش شده گذشته می تواند در کشف واقعی یا غیرواقعی بودن ادعای خسارت، مفید باشد. یکی از متداول ترین و پرکاربردترین راه های کشف الگوی داده ها استفاده از ر...
Mining a Chemical Database for Fragment Co-occurrence: Discovery of "Chemical Clichés"
Nowadays millions of different compounds are known, their structures stored in electronic databases. Analysis of these data could yield valuable insights into the laws of chemistry and the habits of chemists. We have therefore explored the public database of the National Cancer Institute (>250,000 compounds) by pattern searching. We split the molecules of this database into fragments to find ou...
متن کاملdata mining rules and classification methods in insurance: the case of collision insurance
assigning premium to the insurance contract in iran mostly has based on some old rules have been authorized by government, in such a situation predicting premium by analyzing database and it’s characteristics will be definitely such a big mistake. therefore the most beneficial information one can gathered from these data is the amount of loss happens during one contract to predicting insurance ...
15 صفحه اولذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Computer Chemistry, Japan
سال: 2012
ISSN: 1347-1767,1347-3824
DOI: 10.2477/jccj.2012-0002